Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 61
Filtrar
1.
Genome Res ; 34(2): 189-200, 2024 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-38408788

RESUMO

Recent studies have revealed an unexplored population of long cell-free DNA (cfDNA) molecules in human plasma using long-read sequencing technologies. However, the biological properties of long cfDNA molecules (>500 bp) remain largely unknown. To this end, we have investigated the origins of long cfDNA molecules from different genomic elements. Analysis of plasma cfDNA using long-read sequencing reveals an uneven distribution of long molecules from across the genome. Long cfDNA molecules show overrepresentation in euchromatic regions of the genome, in sharp contrast to short DNA molecules. We observe a stronger relationship between the abundance of long molecules and mRNA gene expression levels, compared with short molecules (Pearson's r = 0.71 vs. -0.14). Moreover, long and short molecules show distinct fragmentation patterns surrounding CpG sites. Leveraging the cleavage preferences surrounding CpG sites, the combined cleavage ratios of long and short molecules can differentiate patients with hepatocellular carcinoma (HCC) from non-HCC subjects (AUC = 0.87). We also investigated knockout mice in which selected nuclease genes had been inactivated in comparison with wild-type mice. The proportion of long molecules originating from transcription start sites are lower in Dffb-deficient mice but higher in Dnase1l3-deficient mice compared with that of wild-type mice. This work thus provides new insights into the biological properties and potential clinical applications of long cfDNA molecules.


Assuntos
Carcinoma Hepatocelular , Ácidos Nucleicos Livres , Neoplasias Hepáticas , Humanos , Animais , Camundongos , Ácidos Nucleicos Livres/genética , Carcinoma Hepatocelular/genética , Neoplasias Hepáticas/genética , DNA/genética , Genômica , Camundongos Knockout , Endodesoxirribonucleases/genética
2.
Clin Chem ; 69(2): 189-201, 2023 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-36576350

RESUMO

BACKGROUND: Nuclear-derived cell-free DNA (cfDNA) molecules in blood plasma are nonrandomly fragmented, bearing a wealth of information related to tissues of origin. DNASE1L3 (deoxyribonuclease 1 like 3) is an important player in shaping the fragmentation of nuclear-derived cfDNA molecules, preferentially generating molecules with 5 CC dinucleotide termini (i.e., 5 CC-end motif). However, the fragment end properties of microbial cfDNA and its clinical implication remain to be explored. METHODS: We performed end motif analysis on microbial cfDNA fragments in plasma samples from patients with sepsis. A sequence context-based normalization method was used to minimize the potential biases for end motif analysis. RESULTS: The end motif profiles of microbial cfDNA appeared to resemble that of nuclear cfDNA (Spearman correlation coefficient: 0.82, P value 0.001). The CC-end motif was the most preferred end motif in microbial cfDNA, suggesting that DNASE1L3 might also play a role in the fragmentation of microbe-derived cfDNA in plasma. Of note, differential end motifs were present between microbial cfDNA originating from infection-causing pathogens (enriched at the CC-end) and contaminating microbial DNA potentially derived from reagents or the environment (nearly random). The use of fragment end signatures allowed differentiation between confirmed pathogens and contaminating microbes, with an area under the receiver operating characteristic curve of 0.99. The performance appeared to be superior to conventional analysis based on microbial cfDNA abundance alone. CONCLUSIONS: The use of fragmentomic features could facilitate the differentiation of underlying contaminating microbes from true pathogens in sepsis. This work demonstrates the potential usefulness of microbial cfDNA fragmentomics in metagenomics analysis.


Assuntos
Ácidos Nucleicos Livres , Sepse , Humanos , DNA/genética , Sepse/diagnóstico , Fragmentação do DNA
3.
Clin Chem ; 69(2): 168-179, 2023 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-36322427

RESUMO

BACKGROUND: Recent studies using single molecule, real-time (SMRT) sequencing revealed a substantial population of analyzable long cell-free DNA (cfDNA) in plasma. Potential clinical utilities of such long cfDNA in pregnancy and cancer have been demonstrated. However, the performance of different long-read sequencing platforms for the analysis of long cfDNA remains unknown. METHODS: Size biases of SMRT sequencing by Pacific Biosciences (PacBio) and nanopore sequencing by Oxford Nanopore Technologies (ONT) were evaluated using artificial mixtures of sonicated human and mouse DNA of different sizes. cfDNA from plasma samples of pregnant women at different trimesters, hepatitis B carriers, and patients with hepatocellular carcinoma were sequenced with the 2 platforms. RESULTS: Both platforms showed biases to sequence longer (1500 bp vs 200 bp) DNA fragments, with PacBio showing a stronger bias (5-fold overrepresentation of long fragments vs 2-fold in ONT). Percentages of cfDNA fragments 500 bp were around 6-fold higher in PacBio compared with ONT. End motif profiles of cfDNA from PacBio and ONT were similar, yet exhibited platform-dependent patterns. Tissue-of-origin analysis based on single-molecule methylation patterns showed comparable performance on both platforms. CONCLUSIONS: SMRT sequencing generated data with higher percentages of long cfDNA compared with nanopore sequencing. Yet, a higher number of long cfDNA fragments eligible for the tissue-of-origin analysis could be obtained from nanopore sequencing due to its much higher throughput. When analyzing the size and end motif of cfDNA, one should be aware of the analytical characteristics and possible biases of the sequencing platforms being used.


Assuntos
Ácidos Nucleicos Livres , Neoplasias Hepáticas , Sequenciamento por Nanoporos , Humanos , Feminino , Gravidez , Animais , Camundongos , Ácidos Nucleicos Livres/genética , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , DNA/genética
4.
Proc Natl Acad Sci U S A ; 119(44): e2209852119, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-36288287

RESUMO

Cell-free DNA (cfDNA) fragmentation patterns contain important molecular information linked to tissues of origin. We explored the possibility of using fragmentation patterns to predict cytosine-phosphate-guanine (CpG) methylation of cfDNA, obviating the use of bisulfite treatment and associated risks of DNA degradation. This study investigated the cfDNA cleavage profile surrounding a CpG (i.e., within an 11-nucleotide [nt] window) to analyze cfDNA methylation. The cfDNA cleavage proportion across positions within the window appeared nonrandom and exhibited correlation with methylation status. The mean cleavage proportion was ∼twofold higher at the cytosine of methylated CpGs than unmethylated ones in healthy controls. In contrast, the mean cleavage proportion rapidly decreased at the 1-nt position immediately preceding methylated CpGs. Such differential cleavages resulted in a characteristic change in relative presentations of CGN and NCG motifs at 5' ends, where N represented any nucleotide. CGN/NCG motif ratios were correlated with methylation levels at tissue-specific methylated CpGs (e.g., placenta or liver) (Pearson's absolute r > 0.86). cfDNA cleavage profiles were thus informative for cfDNA methylation and tissue-of-origin analyses. Using CG-containing end motifs, we achieved an area under a receiver operating characteristic curve (AUC) of 0.98 in differentiating patients with and without hepatocellular carcinoma and enhanced the positive predictive value of nasopharyngeal carcinoma screening (from 19.6 to 26.8%). Furthermore, we elucidated the feasibility of using cfDNA cleavage patterns to deduce CpG methylation at single CpG resolution using a deep learning algorithm and achieved an AUC of 0.93. FRAGmentomics-based Methylation Analysis (FRAGMA) presents many possibilities for noninvasive prenatal, cancer, and organ transplantation assessment.


Assuntos
Ácidos Nucleicos Livres , Neoplasias Hepáticas , Gravidez , Feminino , Humanos , Ácidos Nucleicos Livres/genética , Biomarcadores Tumorais/genética , Metilação de DNA , Neoplasias Hepáticas/genética , Epigênese Genética , DNA/genética , Citosina , Guanina , Nucleotídeos , Fosfatos
5.
Clin Chem ; 68(9): 1151-1163, 2022 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-35587130

RESUMO

BACKGROUND: Analysis of circulating tumor DNA has become increasingly important as a tool for cancer care. However, the focus of previous studies has been on short fragments of DNA. Also, bisulfite sequencing, a conventional approach for methylation analysis, causes DNA degradation, which is not ideal for the assessment of long DNA properties and methylation patterns. This study attempted to overcome such obstacles by single-molecule sequencing. METHODS: Single-molecule real-time (SMRT) sequencing was used to sequence plasma DNA. We performed fragment size and direct methylation analysis for each molecule. A methylation score concerning single-molecule methylation patterns was used for cancer detection. RESULTS: A substantial proportion of plasma DNA was longer than 1 kb with a median of 16% in hepatocellular carcinoma (HCC) patients, hepatitis B virus carriers, and healthy individuals. The longest plasma DNA molecule in the HCC patients was 39.8 kb. Tumoral cell-free DNA (cfDNA) was generally shorter than nontumoral cfDNA. The longest tumoral cfDNA was 13.6 kb. Tumoral cfDNA had lower methylation levels compared with nontumoral cfDNA (median: 59.3% vs 76.9%). We developed and analyzed a metric reflecting single-molecule methylation patterns associated with cancer, named the HCC methylation score. HCC patients displayed significantly higher HCC methylation scores than those without HCC. Interestingly, compared to using short cfDNA (area under the receiver operating characteristic [ROC] curve, AUC: 0.75), the use of long cfDNA molecules greatly enhanced the discriminatory power (AUC: 0.91). CONCLUSIONS: A previously unidentified long cfDNA population was revealed in cancer patients. The presence and direct methylation analysis of these molecules open new possibilities for cancer liquid biopsy.


Assuntos
Carcinoma Hepatocelular , Ácidos Nucleicos Livres , Neoplasias Hepáticas , Biomarcadores Tumorais , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Ácidos Nucleicos Livres/genética , DNA , Metilação de DNA , Humanos , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética
6.
JCI Insight ; 7(8)2022 04 22.
Artigo em Inglês | MEDLINE | ID: mdl-35451374

RESUMO

Cell-free extrachromosomal circular DNA (eccDNA) as a distinct topological form from linear DNA has recently gained increasing research interest, with possible clinical applications as a class of biomarkers. In this study, we aimed to explore the relationship between nucleases and eccDNA characteristics in plasma. By using knockout mouse models with deficiencies in deoxyribonuclease 1 (DNASE1) or deoxyribonuclease 1 like 3 (DNASE1L3), we found that cell-free eccDNA in Dnase1l3-/- mice exhibited larger size distributions than that in wild-type mice. Such size alterations were not found in tissue eccDNA of either Dnase1-/- or Dnase1l3-/- mice, suggesting that DNASE1L3 could digest eccDNA extracellularly but did not seem to affect intracellular eccDNA. Using a mouse pregnancy model, we observed that in Dnase1l3-/- mice pregnant with Dnase1l3+/- fetuses, the eccDNA in the maternal plasma was shorter compared with that of Dnase1l3-/- mice carrying Dnase1l3-/- fetuses, highlighting the systemic effects of circulating fetal DNASE1L3 degrading the maternal eccDNA extracellularly. Furthermore, plasma eccDNA in patients with DNASE1L3 mutations also exhibited longer size distributions than that in healthy controls. Taken together, this study provided a hitherto missing link between nuclease activity and the biological manifestations of eccDNA in plasma, paving the way for future biomarker development of this special form of DNA molecules.


Assuntos
DNA , Feto , Animais , DNA Circular/genética , Desoxirribonucleases/genética , Endodesoxirribonucleases/genética , Endodesoxirribonucleases/metabolismo , Feminino , Feto/metabolismo , Humanos , Camundongos , Camundongos Knockout , Gravidez
7.
NPJ Genom Med ; 7(1): 14, 2022 Feb 23.
Artigo em Inglês | MEDLINE | ID: mdl-35197474

RESUMO

Single-stranded ends of double-stranded DNA (jagged ends) are more abundant in urinary DNA than in plasma DNA. However, the lengths of jagged ends in urinary DNA remained undetermined, as a previous method used for urinary DNA jagged end sequencing analysis (Jag-seq) relied on unmethylation at CpG sites, limiting the resolution. Here, we performed high-resolution Jag-seq analysis using methylation at non-CpG cytosine sites, allowing determination of exact length of jagged ends. The urinary DNA bore longer jagged ends (~26-nt) than plasma DNA (~17-nt). The jagged end length distribution displayed 10-nt periodicities in urinary DNA, which were much less observable in plasma DNA. Amplitude of the 10-nt periodicities increased in patients with renal cell carcinoma. Heparin treatment of urine diminished the 10-nt periodicities. The urinary DNA jagged ends often extended into nucleosomal cores, suggesting potential interactions with histones. This study has thus advanced our knowledge of jagged ends in urine DNA.

8.
Proc Natl Acad Sci U S A ; 118(50)2021 12 14.
Artigo em Inglês | MEDLINE | ID: mdl-34873045

RESUMO

In the field of circulating cell-free DNA, most of the studies have focused on short DNA molecules (e.g., <500 bp). The existence of long cell-free DNA molecules has been poorly explored. In this study, we demonstrated that single-molecule real-time sequencing allowed us to detect and analyze a substantial proportion of long DNA molecules from both fetal and maternal sources in maternal plasma. Such molecules were beyond the size detection limits of short-read sequencing technologies. The proportions of long cell-free DNA molecules in maternal plasma over 500 bp were 15.5%, 19.8%, and 32.3% for the first, second, and third trimesters, respectively. The longest fetal-derived plasma DNA molecule observed was 23,635 bp. Long plasma DNA molecules demonstrated predominance of A or G 5' fragment ends. Pregnancies with preeclampsia demonstrated a reduction in long maternal plasma DNA molecules, reduced frequencies for selected 5' 4-mer end motifs ending with G or A, and increased frequencies for selected motifs ending with T or C. Finally, we have developed an approach that employs the analysis of methylation patterns of the series of CpG sites on a long DNA molecule for determining its tissue origin. This approach achieved an area under the curve of 0.88 in differentiating between fetal and maternal plasma DNA molecules, enabling the determination of maternal inheritance and recombination events in the fetal genome. This work opens up potential clinical utilities of long cell-free DNA analysis in maternal plasma including noninvasive prenatal testing of monogenic diseases and detection/monitoring of pregnancy-associated disorders such as preeclampsia.


Assuntos
Ácidos Nucleicos Livres/sangue , Ácidos Nucleicos Livres/genética , Adulto , Cromossomos/genética , Simulação por Computador , Feminino , Feto , Humanos , Gravidez , Imagem Individual de Molécula
9.
Microbiome ; 9(1): 205, 2021 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-34649602

RESUMO

BACKGROUND: Long-read sequencing has shown its tremendous potential to address genome assembly challenges, e.g., achieving the first telomere-to-telomere assembly of a gapless human chromosome. However, many issues remain unresolved when leveraging error-prone long reads to characterize high-complexity metagenomes, for instance, complete/high-quality genome reconstruction from highly complex systems. RESULTS: Here, we developed an iterative haplotype-resolved hierarchical clustering-based hybrid assembly (HCBHA) approach that capitalizes on a hybrid (error-prone long reads and high-accuracy short reads) sequencing strategy to reconstruct (near-) complete genomes from highly complex metagenomes. Using the HCBHA approach, we first phase short and long reads from the highly complex metagenomic dataset into different candidate bacterial haplotypes, then perform hybrid assembly of each bacterial genome individually. We reconstructed 557 metagenome-assembled genomes (MAGs) with an average N50 of 574 Kb from a deeply sequenced, highly complex activated sludge (AS) metagenome. These high-contiguity MAGs contained 14 closed genomes and 111 high-quality (HQ) MAGs including full-length rRNA operons, which accounted for 61.1% of the microbial community. Leveraging the near-complete genomes, we also profiled the metabolic potential of the AS microbiome and identified 2153 biosynthetic gene clusters (BGCs) encoded within the recovered AS MAGs. CONCLUSION: Our results established the feasibility of an iterative haplotype-resolved HCBHA approach to reconstruct (near-) complete genomes from highly complex ecosystems, providing new insights into "complete metagenomics". The retrieved high-contiguity MAGs illustrated that various biosynthetic gene clusters (BGCs) were harbored in the AS microbiome. The high diversity of BGCs highlights the potential to discover new natural products biosynthesized by the AS microbial community, aside from the traditional function (e.g., organic carbon and nitrogen removal) in wastewater treatment. Video Abstract.


Assuntos
Microbiota , Esgotos , Genoma Bacteriano/genética , Humanos , Metagenoma/genética , Metagenômica , Microbiota/genética
10.
Clin Chem ; 67(11): 1492-1502, 2021 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-34463757

RESUMO

BACKGROUND: Human plasma contains RNA transcripts released by multiple cell types within the body. Single-cell transcriptomic analysis allows the cellular origin of circulating RNA molecules to be elucidated at high resolution and has been successfully utilized in the pregnancy context. We explored the application of a similar approach to develop plasma RNA markers for cancer detection. METHODS: Single-cell RNA sequencing was performed to decipher transcriptomic profiles of single cells from hepatocellular carcinoma (HCC) samples. Cell-type-specific transcripts were identified and used for deducing the cell-type-specific gene signature (CELSIG) scores of plasma RNA from patients with and without HCC. RESULTS: Six major cell clusters were identified, including hepatocyte-like, cholangiocyte-like, myofibroblast, endothelial, lymphoid, and myeloid cell clusters based on 4 HCC tumor tissues as well as their paired adjacent nontumoral tissues. The CELSIG score of hepatocyte-like cells was significantly increased in preoperative plasma RNA samples of patients with HCC (n = 14) compared with non-HCC participants (n = 49). The CELSIG score of hepatocyte-like cells declined in plasma RNA samples of patients with HCC within 3 days after tumor resection. Compared with the discriminating power between patients with and without HCC using the abundance of ALB transcript in plasma [area under curve (AUC) 0.72)], an improved performance (AUC: 0.84) was observed using the CELSIG score. The hepatocyte-specific transcript markers in plasma RNA were further validated by ddPCR assays. The CELSIG scores of hepatocyte-like cell and cholangiocyte trended with patients' survival. CONCLUSIONS: The combination of single-cell transcriptomic analysis and plasma RNA sequencing represents an approach for the development of new noninvasive cancer markers.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Biomarcadores Tumorais , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/patologia , Humanos , Biópsia Líquida , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/patologia , RNA/genética , Análise de Sequência de RNA
11.
Elife ; 102021 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-33752803

RESUMO

We developed genetic-epigenetic tissue mapping (GETMap) to determine the tissue composition of plasma DNA carrying genetic variants not present in the constitutional genome through comparing their methylation profiles with relevant tissues. We validated this approach by showing that, in pregnant women, circulating DNA carrying fetal-specific alleles was entirely placenta-derived. In lung transplant recipients, we showed that, at 72 hr after transplantation, the lung contributed only a median of 17% to the plasma DNA carrying donor-specific alleles, and hematopoietic cells contributed a median of 78%. In hepatocellular cancer patients, the liver was identified as the predominant source of plasma DNA carrying tumor-specific mutations. In a pregnant woman with lymphoma, plasma DNA molecules carrying cancer mutations and fetal-specific alleles were accurately shown to be derived from the lymphocytes and placenta, respectively. Analysis of tissue origin for plasma DNA carrying genetic variants is potentially useful for noninvasive prenatal testing, transplantation monitoring, and cancer screening.


Assuntos
DNA/sangue , Epigenômica/métodos , Neoplasias/genética , Transplante de Órgãos/métodos , Diagnóstico Pré-Natal/métodos , Adulto , Idoso , Biomarcadores Tumorais/genética , Carcinoma Hepatocelular/genética , DNA/genética , Metilação de DNA , DNA de Neoplasias/sangue , DNA de Neoplasias/genética , Epigênese Genética , Feminino , Feto/metabolismo , Variação Genética , Humanos , Neoplasias Hepáticas/genética , Linfoma/genética , Masculino , Pessoa de Meia-Idade , Neoplasias/sangue , Placenta/metabolismo , Gravidez , Análise de Sequência de DNA/métodos
12.
Clin Chem ; 67(5): 788-796, 2021 04 29.
Artigo em Inglês | MEDLINE | ID: mdl-33615350

RESUMO

BACKGROUND: Although the characterization of cell-free extrachromosomal circular DNA (eccDNA) has gained much research interest, the methylation status of these molecules is yet to be elucidated. We set out to compare the methylation densities of plasma eccDNA of maternal and fetal origins, and between small and large molecules. The clearance of fetal eccDNA from maternal circulation was also investigated. METHODS: We developed a sequencing protocol for eccDNA methylation analysis using tagmentation and enzymatic conversion approaches. A restriction enzyme-based approach was applied to verify the tagmentation results. The efficiency of cell-free fetal eccDNA clearance was investigated by fetal eccDNA fraction evaluations at various postpartum time points. RESULTS: The methylation densities of fetal eccDNA (median: 56.3%; range: 40.5-67.6%) were lower than the maternal eccDNA (median: 66.7%; range: 56.5-75.7%) (P = 0.02, paired t-test). In addition, eccDNA molecules from the smaller peak cluster (180-230 bp) were of lower methylation levels than those from the larger peak cluster (300-450 bp). Both of these findings were confirmed using the restriction enzyme approach. We also observed comparable methylation densities between linear and eccDNA of both maternal and fetal origins. The average half-lives of fetal linear and eccDNA in the maternal blood were 30.2 and 29.7 min, respectively. CONCLUSIONS: We found that fetal eccDNA in plasma was relatively hypomethylated compared to the maternal eccDNA. The methylation densities of eccDNA were positively correlated with their sizes. In addition, fetal eccDNA was found to be rapidly cleared from the maternal blood after delivery, similar to fetal linear DNA.


Assuntos
DNA Circular , DNA , DNA/genética , Metilação de DNA , Feminino , Feto , Humanos , Metilação , Plasma
13.
Clin Chem ; 67(4): 621-630, 2021 03 31.
Artigo em Inglês | MEDLINE | ID: mdl-33604652

RESUMO

BACKGROUND: Double-stranded DNA in plasma is known to carry single-stranded ends, called jagged ends. Plasma DNA jagged ends are biomarkers for pathophysiologic states such as pregnancy and cancer. It remains unknown whether urinary cell-free DNA (cfDNA) molecules have jagged ends. METHODS: Jagged ends of cfDNA were detected by incorporating unmethylated cytosines during a DNA end-repair process, followed by bisulfite sequencing. Incorporation of unmethylated cytosines during the repair of the jagged ends lowered the apparent methylation levels measured by bisulfite sequencing and were used to calculate a jagged end index. This approach is called jagged end analysis by sequencing. RESULTS: The jagged end index of urinary cfDNA was higher than that of plasma DNA. The jagged end index profile of plasma DNA displayed several strongly oscillating major peaks at intervals of approximately 165 bp (i.e., nucleosome size) and weakly oscillating minor peaks with periodicities of approximately 10 bp. In contrast, the urinary DNA jagged end index profile showed weakly oscillating major peaks but strongly oscillating minor peaks. The jagged end index was generally higher in nucleosomal linker DNA regions. Patients with bladder cancer (n = 46) had lower jagged end indexed of urinary DNA than participants without bladder cancer (n = 39). The area under the curve for differentiating between patients with and without bladder cancer was 0.83. CONCLUSIONS: Jagged ends represent a property of urinary cfDNA. The generation of jagged ends might be related to nucleosomal structures, with enrichment in linker DNA regions. Jagged ends of urinary DNA could potentially serve as a new biomarker for bladder cancer detection.


Assuntos
Ácidos Nucleicos Livres , Neoplasias da Bexiga Urinária , Biomarcadores Tumorais/genética , Ácidos Nucleicos Livres/genética , DNA/genética , Metilação de DNA , Estudos de Viabilidade , Feminino , Humanos , Nucleossomos , Gravidez , Análise de Sequência de DNA , Neoplasias da Bexiga Urinária/diagnóstico , Neoplasias da Bexiga Urinária/genética
14.
Proc Natl Acad Sci U S A ; 118(5)2021 02 02.
Artigo em Inglês | MEDLINE | ID: mdl-33495335

RESUMO

5-Methylcytosine (5mC) is an important type of epigenetic modification. Bisulfite sequencing (BS-seq) has limitations, such as severe DNA degradation. Using single molecule real-time sequencing, we developed a methodology to directly examine 5mC. This approach holistically examined kinetic signals of a DNA polymerase (including interpulse duration and pulse width) and sequence context for every nucleotide within a measurement window, termed the holistic kinetic (HK) model. The measurement window of each analyzed double-stranded DNA molecule comprised 21 nucleotides with a cytosine in a CpG site in the center. We used amplified DNA (unmethylated) and M.SssI-treated DNA (methylated) (M.SssI being a CpG methyltransferase) to train a convolutional neural network. The area under the curve for differentiating methylation states using such samples was up to 0.97. The sensitivity and specificity for genome-wide 5mC detection at single-base resolution reached 90% and 94%, respectively. The HK model was then tested on human-mouse hybrid fragments in which each member of the hybrid had a different methylation status. The model was also tested on human genomic DNA molecules extracted from various biological samples, such as buffy coat, placental, and tumoral tissues. The overall methylation levels deduced by the HK model were well correlated with those by BS-seq (r = 0.99; P < 0.0001) and allowed the measurement of allele-specific methylation patterns in imprinted genes. Taken together, this methodology has provided a system for simultaneous genome-wide genetic and epigenetic analyses.


Assuntos
Citosina/metabolismo , Metilação de DNA/genética , Análise de Sequência de DNA , Imagem Individual de Molécula , Animais , Sequência de Bases , DNA/metabolismo , Impressão Genômica , Humanos , Camundongos , Modelos Biológicos
15.
Prenat Diagn ; 41(3): 368-375, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33140416

RESUMO

OBJECTIVES: Due to the maternally-inherited nature of mitochondrial DNA (mtDNA), there is a lack of information regarding fetal mtDNA in the plasma of pregnant women. We aim to explore the presence and topologic forms of circulating fetal and maternal mtDNA molecules in surrogate pregnancies. METHODS: Genotypic differences between fetal and surrogate maternal mtDNA were used to identify the fetal and maternal mtDNA molecules in plasma. Plasma samples were obtained from the surrogate pregnant mothers. Using cleavage-end signatures of BfaI restriction enzyme, linear and circular mtDNA molecules in maternal plasma could be differentiated. RESULTS: Fetal-derived mtDNA molecules were mainly linear (median: 88%; range: 80%-96%), whereas approximately half of the maternal-derived mtDNA molecules were circular (median: 51%; range: 42%-60%). The fetal DNA fraction of linear mtDNA was lower (median absolute difference: 9.8%; range: 1.1%-27%) than that of nuclear DNA (median: 20%; range: 9.7%-35%). The fetal-derived linear mtDNA molecules were shorter than the maternal-derived ones. CONCLUSION: Fetal mtDNA is present in maternal plasma, and consists mainly of linear molecules. Surrogate pregnancies represent a valuable clinical scenario for exploring the biology and potential clinical applications of circulating mtDNA, for example, for pregnancies conceived following mitochondrial replacement therapy.


Assuntos
DNA Mitocondrial/genética , Feto/anormalidades , Mães Substitutas/estatística & dados numéricos , Adulto , DNA Mitocondrial/sangue , Feminino , Feto/fisiopatologia , Humanos , Herança Materna/genética , Moscou/epidemiologia , Plasma/microbiologia , Gravidez
16.
Microbiome ; 8(1): 155, 2020 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-33158461

RESUMO

BACKGROUND: Genome-centric approaches are widely used to investigate microbial compositions, dynamics, ecology, and interactions within various environmental systems. Hundreds or even thousands of genomes could be retrieved in a single study contributed by the cost-effective short-read sequencing and developed assembly/binning pipelines. However, conventional binning methods usually yield highly fragmented draft genomes that limit our ability to comprehensively understand these microbial communities. Thus, to leverage advantage of both the long and short reads to retrieve more complete genomes from environmental samples is a must-do task to move this direction forward. RESULTS: Here, we used an iterative hybrid assembly (IHA) approach to reconstruct 49 metagenome-assembled genomes (MAGs), including 27 high-quality (HQ) and high-contiguity (HC) genomes with contig number ≤ 5, eight of which were circular finished genomes from a partial-nitritation anammox (PNA) reactor. These 49 recovered MAGs (43 MAGs encoding full-length rRNA, average N50 of 2.2 Mbp), represented the majority (92.3%) of the bacterial community. Moreover, the workflow retrieved HQ and HC MAGs even with an extremely low coverage (relative abundance < 0.1%). Among them, 34 MAGs could not be assigned to the genus level, indicating the novelty of the genomes retrieved using the IHA method proposed in this study. Comparative analysis of HQ MAG pairs reconstructed using two methods, i.e., hybrid and short reads only, revealed that identical genes in the MAG pairs represented 87.5% and 95.5% of the total gene inventory of hybrid and short reads only assembled MAGs, respectively. In addition, the first finished anammox genome of the genus Ca. Brocadia reconstructed revealed that there were two identical hydrazine synthase (hzs) genes, providing the exact gene copy number of this crucial phylomarker of anammox at the genome level. CONCLUSIONS: Our results showcased the high-quality and high-contiguity genome retrieval performance and demonstrated the feasibility of complete genome reconstruction using the IHA workflow from the enrichment system. These (near-) complete genomes provided a high resolution of the microbial community, which might help to understand the bacterial repertoire of anammox-associated systems. Combined with other validation experiments, the workflow can enable a detailed view of the anammox or other similar enrichment systems. Video Abstract.


Assuntos
Compostos de Amônio/metabolismo , Bactérias/genética , Bactérias/metabolismo , Crescimento Quimioautotrófico , Genoma Bacteriano/genética , Metagenoma/genética , Microbiota/genética , Nitritos/metabolismo , Anaerobiose , Oxirredução
17.
Genome Res ; 30(8): 1144-1153, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32801148

RESUMO

Cell-free DNA in plasma has been used for noninvasive prenatal testing and cancer liquid biopsy. The physical properties of cell-free DNA fragments in plasma, such as fragment sizes and ends, have attracted much recent interest, leading to the emerging field of cell-free DNA fragmentomics. However, one aspect of plasma DNA fragmentomics as to whether double-stranded plasma molecules might carry single-stranded ends, termed a jagged end in this study, remains underexplored. We have developed two approaches for investigating the presence of jagged ends in a plasma DNA pool. These approaches utilized DNA end repair to introduce differential methylation signals between the original sequence and the jagged ends, depending on whether unmethylated or methylated cytosines were used in the DNA end-repair procedure. The majority of plasma DNA molecules (87.8%) were found to bear jagged ends. The jaggedness varied according to plasma DNA fragment sizes and appeared to be in association with nucleosomal patterns. In the plasma of pregnant women, the jaggedness of fetal DNA molecules was higher than that of the maternal counterparts. The jaggedness of plasma DNA correlated with the fetal DNA fraction. Similarly, in the plasma of cancer patients, tumor-derived DNA molecules in patients with hepatocellular carcinoma showed an elevated jaggedness compared with nontumoral DNA. In mouse models, knocking out of the Dnase1 gene reduced jaggedness, whereas knocking out of the Dnase1l3 gene enhanced jaggedness. Hence, plasma DNA jagged ends represent an intrinsic property of plasma DNA and provide a link between nuclease activities and the fragmentation of plasma DNA.


Assuntos
Ácidos Nucleicos Livres/sangue , Fragmentação do DNA , Metilação de DNA/genética , DNA/sangue , DNA/genética , Animais , Carcinoma Hepatocelular/genética , Ácidos Nucleicos Livres/genética , Reparo do DNA por Junção de Extremidades/genética , Endodesoxirribonucleases/genética , Feminino , Humanos , Neoplasias Hepáticas/genética , Camundongos , Camundongos Knockout , Nucleossomos/genética , Gravidez
18.
Clin Chem ; 66(4): 598-605, 2020 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-32191318

RESUMO

BACKGROUND: Nasopharyngeal carcinoma (NPC) is strongly associated with Epstein-Barr virus (EBV) infection. Plasma EBV DNA is a validated screening tool for NPC. In screening, there are some individuals who do not have NPC but carry EBV DNA in plasma. Currently it is not known from screening if there may be any genotypic differences in EBV isolates from NPC and non-NPC subjects. Also, low concentrations of EBV DNA in plasma could pose challenge to such EBV genotypic analysis through plasma DNA sequencing. METHODS: In a training dataset comprised of plasma DNA sequencing data of NPC and non-NPC subjects, we studied the difference in the EBV single nucleotide variant (SNV) profiles between the two groups. The most differentiating SNVs across the EBV genome were identified. We proposed an NPC risk score to be derived from the genotypic patterns over these SNV sites. We subsequently analyzed the NPC risk scores in a testing set. RESULTS: A total of 661 significant SNVs across the EBV genome were identified from the training set. In the testing set, NPC plasma samples were shown to have high NPC risk scores, which suggested the presence of NPC-associated EBV SNV profiles. Among the non-NPC samples, there was a wide range of NPC risk scores. These results support the presence of diverse SNV profiles of EBV isolates from non-NPC subjects. CONCLUSION: EBV genotypic analysis is feasible through plasma DNA sequencing. The NPC risk score may be used to inform the cancer risk based on the EBV genome-wide SNV profile.


Assuntos
DNA Viral/sangue , Infecções por Vírus Epstein-Barr/virologia , Herpesvirus Humano 4/genética , Carcinoma Nasofaríngeo/virologia , Infecções por Vírus Epstein-Barr/sangue , Infecções por Vírus Epstein-Barr/complicações , Genoma Viral , Genótipo , Humanos , Modelos Biológicos , Carcinoma Nasofaríngeo/sangue , Carcinoma Nasofaríngeo/etiologia , Polimorfismo de Nucleotídeo Único , Fatores de Risco , Análise de Sequência de DNA
19.
Cancer Discov ; 10(5): 664-673, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-32111602

RESUMO

Plasma DNA fragmentomics is an emerging area of research covering plasma DNA sizes, end points, and nucleosome footprints. In the present study, we found a significant increase in the diversity of plasma DNA end motifs in patients with hepatocellular carcinoma (HCC). Compared with patients without HCC, patients with HCC showed a preferential pattern of 4-mer end motifs. In particular, the abundance of plasma DNA motif CCCA was much lower in patients with HCC than in subjects without HCC. The aberrant end motifs were also observed in patients with other cancer types, including colorectal cancer, lung cancer, nasopharyngeal carcinoma, and head and neck squamous cell carcinoma. We further observed that the profile of plasma DNA end motifs originating from the same organ, such as the liver, placenta, and hematopoietic cells, generally clustered together. The profile of end motifs may therefore serve as a class of biomarkers for liquid biopsy in oncology, noninvasive prenatal testing, and transplantation monitoring. SIGNIFICANCE: Plasma DNA molecules originating from the liver, HCC and other cancers, placenta, and hematopoietic cells each harbor a set of characteristic plasma DNA end motifs. Such markers carry tissue-of-origin information and represent a new class of biomarkers in the nascent field of fragmentomics.This article is highlighted in the In This Issue feature, p. 627.


Assuntos
DNA/sangue , Neoplasias Hepáticas/genética , Transplante de Fígado/métodos , Feminino , Humanos , Gravidez
20.
Proc Natl Acad Sci U S A ; 117(3): 1658-1665, 2020 01 21.
Artigo em Inglês | MEDLINE | ID: mdl-31900366

RESUMO

We explored the presence of extrachromosomal circular DNA (eccDNA) in the plasma of pregnant women. Through sequencing following either restriction enzyme or Tn5 transposase treatment, we identified eccDNA molecules in the plasma of pregnant women. These eccDNA molecules showed bimodal size distributions peaking at ∼202 and ∼338 bp with distinct 10-bp periodicity observed throughout the size ranges within both peaks, suggestive of their nucleosomal origin. Also, the predominance of the 338-bp peak of eccDNA indicated that eccDNA had a larger size distribution than linear DNA in human plasma. Moreover, eccDNA of fetal origin were shorter than the maternal eccDNA. Genomic annotation of the overall population of eccDNA molecules revealed a preference of these molecules to be generated from 5'-untranslated regions (5'-UTRs), exonic regions, and CpG island regions. Two sets of trinucleotide repeat motifs flanking the junctional sites of eccDNA supported multiple possible models for eccDNA generation. This work highlights the topologic analysis of plasma DNA, which is an emerging direction for circulating nucleic acid research and applications.


Assuntos
Ácidos Nucleicos Livres/isolamento & purificação , DNA Circular/isolamento & purificação , Plasma/química , Ácidos Nucleicos Livres/química , Ácidos Nucleicos Livres/genética , DNA Circular/química , DNA Circular/genética , Feminino , Genoma Humano , Hong Kong , Humanos , Teste Pré-Natal não Invasivo , Gravidez
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...